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Abstract 

We study the submodular ranking problem in the presence of metric costs. This is motivated by appli- 
cations in broadcast scheduling and web-search ranking. The input to the minimum latency submodular 
cover (MLSC) problem consists of a metric {V, d) with source r E V and m monotone submodular 
functions /i, /2, fm ■ 2^ [0, 1]. The goal is to find a path originating at r that minimizes the total 
cover time of all functions; the cover time of function f i is the smallest value t such that fi has value one 
on the vertices visited within distance t along the path. We give a polynomial time 0(log ^ • log^"^'' \V\)- 
approximation algorithm for this problem; here (5 > is any constant, and e is the minimum positive 
marginal increase of any function fi. When the underlying metric is uniform, we obtain the Ranking 
with Submodular Valuations problem, for which an 0(log i)-approximation algorithm was given by 
Azar and Gamzu lU. In fact, our general result is enabled by a simpler analysis of [1]. 

We also consider the latency covering Steiner tree problem (LCST), which is an interesting special 
case of MLSC. Here, each function fi is associated with a group gi C V and requirement ki such that 

fi{S) = min l^^j-^, l|- This is a natural generalization of the latency group Steiner tree |[l9][8l and 

generalized min-sum set cover ||2][3]|25l problems. We obtain an 0(log'^ |V^|) -approximation algorithm 
for LCST. This relies on a new and stronger LP relaxation for covering Steiner tree, that might be of 
independent interest. We note that any significant improvement over our results would also improve the 
best-known approximation for the well-studied group Steiner tree problem. 

Finally, in a somewhat different direction, we consider the stochastic submodular ranking problem, 
and obtain an 0(log l/e)-approximation algorithm, which is best possible. This problem models sit- 
uations where web pages can be adaptively presented using user feedback. Our result also generalizes 
several previously studied stochastic problems, eg. Iil7.i26.,24j . 

1 Introduction 

Ordering a set of elements so as to be simultaneously good for several valuations is an important issue in 
web-search ranking and broadcast scheduling. A formal model for this was introduced by Azar et al. ||2l 
where they studied the multiple intents re-ranking problem (a.k.a. generalized min-sum set cover [31). 
Subsequently, Azar and Gamzu |[l] studied the submodular ranking problem where the valuations can be 
arbitrary (monotone) submodular functions. 

In this paper, we extend both of these models to the setting of general metric switching costs. This 
allows us to handle additional issues such as: 
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• Data locality: the time taken to read/transmit data j after data i is d{i, j). 

• Context switching: it takes d{i, j) time for a user to parse data j when displayed after data i. 

Thus we study the minimum latency submodular cover problem (MLSC) which is the metric version of 
submodular ranking ll], and the latency covering Steiner tree problem (LCST) which extends generalized 
min-sum set cover We obtain poly-logaiithmic approximation guarantees for both problems. We 

remark that due to a relation to the well-known group Steiner tree |[T6l problem, any significant improvement 
on our results would lead to a similar improvement for group Steiner tree. 

In a somewhat different direction, we also study the stochastic submodular ranking problem, where 
the goal is to adaptively display stochastic elements so as to minimize the expected total cover time. We 
obtain an 0(log i) -approximation algorithm for this problem, which is known to be best possible even in 
the deterministic setting [ll. 

1.1 Problem Definition 

We let V denote the ground set of elements/vertices and d : M+ the distance function. We assume 

that d is symmetric and satisfies triangle inequality. In addition there is a specified root vertex r e F. There 
are m monotone submodular functions /i, ...,/„: 2^ —)• representing the valuations of different users. 
We assume WLOG by normalization that fi{V) = 1 for all i € [m]. Function fi is said to be covered (or 
satisfied) by set 5 C y if fi{S) = 1 = fi{V). The cover time of function in a path vr is the length of the 
shortest prefix of vr that has fi value one, i.e. 

mill t : fi {{v V : V appears within distance t on vr}) = 1. 

The objective in the minimum latency submodular cover problem (MLSC) is to compute a path originating 
at r that minimizes the sum of cover times of all functions. We recover the submodular ranking problem HI 
as the special case when metric d is uniform. A technical parameter that we use to measure performance 
(this was also used in [H) is e which is defined to be the smallest non-zero marginal increase of any function 

{fi}T=i- 

As shown in Azai^ and Gamzu ijTj, the submodular ranking problem contains set-cover as a special 
case (even when m = 1). Similarly, we observe here that MLSC generalizes the group Steiner tree 
problem lfT6ll . where given metric {V,d) with root r and N groups of vertices {gi C V}^-^ the goal is 
find a minimum length tree containing r and at least one vertex from each of the N groups. Note that 
fi{S) = jr E£imin{!£/i n 5|, 1} is a submodular function, with f{S') = 1 iff S'fl^i / for all 
i € [A^]. So MLSC on metric {V,d), root r and one function fi is equivalent to the con^esponding group 
Steiner tree problem (up to factor 2 due to converting a tree into a path). Hence group Steiner tree is a special 
case of MLSC with m = 1 and e = l/N. Similarly, covering Steiner tree, a generalization of group Steiner 
tree can be shown as a special case of MLSC. 

A useful subroutine in our algorithm for MLSC is the submodular orienteering problem (SOP) llT2l 
which is the following. The input is metric (F, d), root r, monotone submodular function / : 2^ ^ R+ and 
length bound B. The goal is to find a path originating at r having length at most B that maximizes f{S), 
where S C y is the set of vertices visited in the path. 

The latency covering Steiner tree problem (LCST) is a natural special case of MLSC, where each 

function fi is associated with a group gi C V and requirement ki such that fi{S) = min | l|- Note 

that in this case, e = 1/ max^^^ ki. The uniform metric special case of LCST reduces to the generalized 
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min-sum set cover problem IHO. When max^^ fcj = 1 in LCST, we obtain the latency group Steiner 
problem, studied in Gupta et al. |IT9l (see also |[27l ) and Chakrabarty-Swamy lH. 
Figure [T] shows the relations between the various problems discussed here. 

MLSC 

j» MLSC: minimum latency submodular cover 

LCST : latency covering Steiner tree 

CST : covering Steiner tree 
GST : group Steiner tree 

SR : submodular ranking 
LGST : latency group Steiner tree 

GMSSC : generalized min-sum set cover 
(multiple intents re-ranking) 

SC : set cover 
MSSC : niin sum set cover 

Figure 1: The various problems and their relations. An anow from X to y means X is a special case of Y. 

Stochastic submodular ranking. Here we are given a set V of stochastic elements, each having an inde- 
pendent distribution over certain domain A. The submodular functions are also defined on groundset A, i.e. 
fi, fm '■ 2^ [0, 1]- In addition, each element i £ V has a deterministic cost/time ii to be scheduled^ 
The realization (from A) of any element is known immediately after scheduling/displaying it. The goal is to 
find an adaptive ordering of V that minimizes the total expected cover time. Since elements are stochastic, 
it is possible that a function is never covered: in such cases we just fix the cover time to be J2i£V ^* (which 
is the total duration of any schedule). 

We will be concerned with adaptive algorithms. Such an algorithm is allowed to decide the next element 
to display based on the instantiations of the previously scheduled elements. This models the setting where 
the algorithm can benefit from user feedback. Our stochastic problem clearly generalizes the stochastic 
set cover studied in ifTTl . Interestingly, it also captures some variants of stochastic set cover that have 
applications in processing multiple queries with probabilistic information Il26ll24l . Various applications of 
stochastic submodular ranking are discussed in more detail in Section |7] 

1.2 Our Results and Techniques 

We start with the general MLSC problem, for which we obtain a poly-logarithmic approximation. 

Theorem 1.1. For any constant 6 > 0, there is an 0(log ^ • log^"'"'^ \V\) -approximation algorithm for the 
minimum latency submodular cover problem. 

'We remark that one could also consider a stochastic generalization of MLSC, where the costs form a metric; however our 
algorithms currently do not apply in this generality. 




LCST 



LGST 



MSSC 
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Note that in the special case of group Steiner tree, this result is only factor 0(log'' \ V\) lai^ger than its best 
known approximation ratio of 0(log N log^ due to Garg et al. |[T6l . In fact our algorithm for MLSC 
is essentially a reduction to submodular orienteering (SOP). Specifically, we show that an (a, /3)-bicriteria 
approximation algorithnU for SOP can be used to obtain an 0{a {3 ■ log ^) -approximation algorithm for 
MLSC. To obtain Theorem ll.il we use an (0(1), 0(log^~'''' |y |))-bicriteria approximation for SOP that 
follows from miHl. Alternatively, we can use the quasi-polynomial time true 0(log |y |)-approximation 
for SOP by Chekuri and Pal llT2l within our algorithm to obtain: 

Theorem 1.2. There is a quasi-polynomial time 0(log 7 -log \ V\)-approximation algorithm for the minimum 
latency submodular cover problem. 

Our algorithm for MLSC is an extension of the elegant "adaptive residual updates scheme" of Azar and 
Gamzu Q for submodular ranking (i.e. uniform metric MLSC). As shown in [HI, an interesting aspect of 
this problem is that the natural greedy algorithm, based on absolute contribution of elements, performs very 
poorly. Instead they used a modified greedy algorithm that selects one element at a time according to residual 
coverage. In the MLSC setting of general metrics, our algorithm uses a similar residual coverage function to 
repeatedly augment the solution. However our augmentations are paths of geometrically increasing lengths, 
instead of just one element. A crucial point in our algorithm is that the residual coverage functions are 
always submodular, and hence we can use submodular orienteering (SOP) in the augmentation step. We 
remark that the approach of covering the maximum number of functions within geometrically increasing 
lengths fails because the residual coverage function here is non-submodular; in fact as noted in [3] this 
subproblem contains the difficult dense-A;-subgraph problem (even for generalized min-sum set cover with 
requirement two). We also note that the choice of our (submodular) residual coverage function ultimately 
draws on the submodular ranking algorithm HI. 

The analysis in [H was based on viewing the optimal and approximate solutions as histograms. This 
approach was first used in this line of work by Feige et al. lITSl for the min-sum set cover problem (see 
also H). This was also the main framework of analysis in 111 for generalized min-sum set cover and 
then for submodular ranking [H. However, these proofs have been increasingly difficult as the problem in 
consideration adds more generality. Instead we follow a different and more direct approach that is similar 
to analysis for the minimum latency TSP, see eg. |[T0l[T3l . In Section [3] we first give a simpler proof of the 
submodular ranking result HI, which also implies the result in Azar et al. Q. Then we present our algorithm 
for MLSC in SectionlH which yields Theorems ll.ll andl l.2l 

Our second main result is for the LCST problem: 
Theorem 1.3. There is an 0(log^ \V\) -approximation algorithm for latency covering Steiner tree. 

Note that our result for the more general MLSC problem implies directly for LCST (i) polynomial time 
0(log kmax'^og'^'^^ |y|)-approximation and (ii) quasi-polynomial time 0(log femax'log |y|)-approximation. 
Here kmax denotes the maximum requirement of any group in LCST. Theorem 11.31 improves on these 
bounds by providing a double logarithmic approximation in polynomial time. As shown in [27], any im- 
provement over Theorem 1 1 . 3 1 even in the kmax = 1 special case (i.e. latency group Steiner tree ) would yield 
an improved approximation ratio for group Steiner tree, which is a long-standing open question. 

Recall that LCST on a uniform metric is the generalized min-sum set cover problem, for which a con- 
stant factor approximation is known due to Bansal et al. 131; the previous greedy-type algorithm lH had a 

^Note that an {a, j3) bicriteria guarantee for SOP means that on every instance, the algorithm produces a path of length at most 
P times the bound and profit at least i times the optimum (that strictly satisfies the length bound). 
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logarithmic ratio. A natural attempt at solving LCST is to extend the algorithm in |[3l to general metrics. 
However, this is not straightforward since the lO algorithm is LP-based, and there is no good LP relaxation 
known for the underlying covering Steiner tree problem. Although the 0(log log^ |y |)-approximation 
algorithm for covering Steiner tree Il20ll23l uses an LP relaxation, that LP is known to have a large integral- 
ity gap 1231 . The algorithms 1201 |23l get around this integrahty gap by iteratively (and adaptively) solving 
a sequence of such LPs; thus the final solution cost cannot be compared against a fixed LP relaxation. In 
context of the LCST problem it is unclear how to use this approach to get an 0(log^ |y D-approximationJl 
We overcome this issue by proposing a new LP relaxation for covering Steiner tree, that has a small (poly- 
logarithmic) integrality gap. The new LP makes use of Knapsack Cover (KC) type inequalities Q. We 
note that KC-inequalities (for set multicover) were also used in f3]- Still, it is non-trivial to formulate these 
stronger inequalities for covering Steiner tree in a way that they are useful in rounding. In particular, the 
separation oracle for the resulting KC-inequalities is more complicated than for set multicover (which was 
the knapsack problem). Once we have the stronger LP relaxation for covering Steiner tree, we can use it in 
an LP-based algorithm for LCST using fairly standai^d techniques. These results are presented in Section [5] 
and Section [6l respectively. 

Our final result is on the stochastic submodular ranking problem. As shown in ifTTlfTSl . even special 
cases of this problem have polynomially large adaptivity gap (ratio between the optimal non-adaptive and 
adaptive solutions). This motivates us to focus on adaptive algorithms, and we obtain the following result in 
Section |7] 

Theorem 1.4. There is an adaptive 0(log ^) -approximation algorithm for the stochastic submodular rank- 
ing problem. 

In particular-, we show that the most natural stochastic extension of the algorithm from iH achieves the 
approximation factor. We remark that the analysis in |[T1 for deterministic submodular ranking assumed unit 
costs, whereas Theorem 1 1 .41 holds for the stochastic setting even with non-uniform costs {£i}. 

As mentioned before, our results generalize the results in ifTTl l26l l24l which study (some variants of) 
stochastic set cover problems. Our analysis is arguably simper and more transparent than |[24l . which gave 
the first tight analysis for these problems. We note that the charging scheme used in |[24l makes use of "dual 
prices" and it does not seem directly applicable to general submodular functions. 

1.3 Previous Work 

The min sum set cover problem was studied in Feige et al. flS] where they proved an approximation ratio of 
4 for the natural greedy algorithm, and also showed a matching hardness of approximation. The algorithmic 
result was also implicit in Bai^-Noy et al. [4J. Azar et al. IS introduced the generalized min-sum set cover 
problem and gave a modified greedy algorithm having an O (log fcmaa) -approximation ratio, where k^ax 
is the maximum requirement of any set. Their analysis built on the histogram based approach from lITSl . 
Later Bansal et al. ||3l gave an improved 0(l)-approximation using LP and knapsack cover inequalities f/l . 
Recently, Skutella and Williamson ll25l improved the approximation ratio to 28 using alpha-point rounding 
methods from scheduling. 

The submodular ranking problem was introduced by Azar and Gamzu [1] where they obtained a tight 
0(log -) -approximation algorithm using a nice residual updates scheme. The analysis was histogram based 
and fairly involved. In contrast, we use a more direct approach that also extends to the general metric setting. 

''The adaptive residual coverage algorithim for MLSC is somewhat similar to adaptively solving LPs for covering Steiner tree; 
but that approach gives us only a weaker bound for LCST. 
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The first poly-logaiithmic approximation for group Steiner tree was 0(log log^ \ V\) due to Garg et 
al. lfT6ll which is still the best known bound. Chekuri et al. ifTTI gave a combinatorial recursive greedy algo- 
rithm that achieved a slightly weaker approximation ratio (the |[T6l result was LP-based). This combinatorial 
approach was extended in Calinescu and Zelikovsky 161 to the problem of covering any submodular function 
in a metric. We use this algorithm in the submodular orienteering (SOP) subroutine for our MLSC result. 
For SOP a true O(log |F|)-approximation is known due to Chekuri and Pal |[T2l . but running in quasi- 
polynomial time. We note that an Q{log^^^ \ V\) hardness of approximation is known for group Steiner tree 
(even on tree metrics) due to Halperin and Kiauthgamer |[2TI . 

The covering Steiner tree problem was introduced in Konjevod et al. ||23l . which can be viewed as 
the multicover version of group Steiner tree. They gave an 0{log{Nkmax) log^ |y|)-approximation us- 
ing an LP-relaxation. However the LP used in ll23l has a large il(/cmax) integrality gap; they got around 
this issue by iteratively solving a suitable sequence of LPs. They also extended the randomized rounding 
analysis from |[T6l to this context. Later, Gupta and Srinivasan ll20l improved the approximation bound to 
0(log N log^ |y|), removing the dependence on the covering requirements. This algorithm was also based 
on solving a similar sequence of LPs; but the improvement was due to a combination of threshold-rounding 
and randomized rounding. In this paper, we give a stronger LP relaxation for covering Steiner based on 
KC-inequalities, that has an 0(log N log^ \ V\) integrality gap. This might be of some independent interest. 
This new LP is also crucial for our 0(log^ |y|)-approximation algorithm for LCST. 

The latency group Steiner problem was studied in |[T9l l27l and also HI, in different contexts. These 
papers imply an 0(log^ |yj)-approximation algorithm for this problem. 

The adaptive set cover problem introduced in lITTl is clearly a special case of stochastic SR that we 
consider. Goemans and Vondrak IITtI showed a large adaptivity gap for set cover, and a logarithmic ap- 
proximation for a variant with multiplicities. A related problem in context of fast query evaluation was 
studied in Munagala et al. |[26l . where the authors gave a triple logarithmic approximation. This bound was 
improved to the optimal logarithmic ratio by Liu et al. ll24l ; this result was also applicable to adaptive set 
cover. Another related paper is by Golovin and Krause fTSl, where they defined a general property "adaptive 
submodularity" and showed nearly optimal approximation guai^antees for several objectives (max coverage, 
min-cost cover and min-sum cover). The most relevant result in ifTSl to the stochastic SR problem is the 
4-approximation for stochastic min sum set cover. This approach required a. fixed submodular function / 



such that the objective is E 



Et>of(y)-fi^t) 



where irt is the instantiation of elements displayed within 

time t and V denotes the instantiation of all elements. However, as mentioned earlier this is not the case 
even for generalized min-sum set cover with requirements two. 



2 Preliminaries 

Let V denote a ground set of elements. A function / : 2^ — > M is said to be submodular if it satisfies the 
property that for any A,B QV, f{A)+f{B) > f{A[JB)+f{AnB). It is well-known that this is equivalent 
to the following property: For any A C B and anyeeV\B, f{A U {e}) - f{A) > f{B U {e}) - f{B). 

Many interesting objectives can be modeled as submodular functions. For example, in economics, 
the latter property of a submodular fuction can be viewed essentially as the law of diminishing marginal 
utility. Also submodular functions appear often in combinatorial optimization. Such examples include set 
unions, cut functions, matroid rank functions etc. We refer the reader to |[28l for an extensive overview of 
submodular functions and their applications. 

From the definition of submodularity, the following useful inequality can be derived. 
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Proposition 2.1. For any A, S C y and submodular function f, it follows that 

{f{B U {e}) - f{B)) > f{B UA)- f{B) 

A submodular function / is said to be monotone if for any A Q B, f{A) < f{B). Since function / is 
defined over all possible subsets of V, it is a standard assumption that a value oracle for / is available. The 
oracle takes as input a subset S '^V and returns the function value f{S) in constant time. 

Notation: We will use ALG to denote the cost of the algorithm we consider. For notational simplicity, we 
will let OPT denote the optimal solution itself or the optimal cost depending on the context. For any integer 
A; > 1 we let [k] := {1,2,..., k}. For notational convenience, we will allow the same notation to denote a 
tree or the set of edges in the tree. 



3 Simpler Analysis of the Submodular Ranking Problem 

In this section, we revisit the submodular ranking problem |[T1. Recall that the input consists of a ground set 
V := [n] of elements and monotone submodular functions /i, /2, ■■■fm '■ 2^"' — >■ [0, 1] with fi{V) = 1, Vz G 
[m]. The goal is to find a complete linear ordering of the elements that minimizes the total cover time of all 
functions. The cover time cov(/j) of fi is defined as the smallest index t such that the function fi has value 
1 on the first t elements in the ordering. We also say that an element e is scheduled at time t if it is the t^^ 
element in the ordering. It is assumed that each function fi satisfies the following property: for any S 5 S', 
if fi{S) — fi{S') > then it must be the case that fi{S) — fi{S') > e, where e > is a constant that is 
uniform for all functions /j. This is a useful parameter in describing the performance guarantee. 

Azar and Gamzu d gave a modified greedy-style algorithm with an approximation factor of 0(log -) 
for submodular ranking. Their analysis was histogram-based and fairly involved. In this section, we give an 
alternate proof of their result. We believe that our analysis is easier and perhaps more intuitive. Our analysis 
also extends to the more general MLSC problem which we will study in the next section. The algorithm 
ALG-AG from [Jj is given below. In the output, 7r(t) denotes the element that appears in the tth time slot. 

Algorithm 1 ALG-AG 

INPUT: Ground set [n]; monotone submodular functions fi : 2["1 ^ [0, 1], i G [m] 



5^0 

for t = 1 to 7i do 

T pf f5/„A ._ V fi{SU{e})-,niS) 
•St 



e = argmaxee[„]\5 / (e) 
S^S[j{e} 
TT{t) ^ e 
end for 

OUTPUT: A linear- ordering (7r(l), 7r(2), . . . , 7r(n)) of [n]. 



Theorem 3.1 (01). The algorithm ALG-AG is an 0{ln(j))-approximation for submodular ranking. 

Let a := 1 + ln{-). To simplify notation, WLOG, we assume that a is an integer. Let R{t) denote 
the set of functions that are not satisfied by ALG-AG earlier than time t; R{t) includes the functions that 
are satisfied exactly at time t. For notational convenience, we will use i G R{t) interchangeably with 
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fi S R{t). Analogous set R*{t) is defined for tlie optimal solution. Note that ALG = J2t>i 1^(01 ^^'^ 
OPT := J2t>i will interested in the number of unsatisfied functions at times {802-^ : j G 

Z+} by ALG-AG and the number of unsatisfied functions at times {2^ : j G Z+} by the optimal solution. 
Let Rj := R{8a2^) and R* = R*{2^) . It is important to note that Rj and R* are concerned with different 
times. For notational simplicity, we let := 0. 

We will show the following key lemma. Roughly speaking, it says that the number of unsatisfied func- 
tions by ALG-AG diminishes quickly unless it is comparable to the number of unsatisfied functions in OPT. 

Lemma 3.2. For any j > 0, we have \Rj\ < l\Rj^i \ + \R*\. 

Proof of Theorem |33J Assuming Lemma [3^ we proceed as follows. 



ALG = ^ E i^wi + E i^wi 

j>0 8a2J<t<8Q23 + i l<t<8a 

< ^ 8a(2^+i -2^)|i?j| + 8aOPT 

j>0 

[Since \R{t)\ is non-increasing, and for any t > 1, < m < OPT] 

= 8a^2^+i(|i?,|-i|i?,_i|) + SaOPT 

< 8a^ 2J+i|i?*| + SaOPT [By Lemma[321 

< 32a^ \R*\ + l6a\R*o\ + 8aOPT 

j>l 2J-i<t<2j 

< 32q;OPT + 24q;OPT [Since is non-increasing] 



Thus we obtain ALG < 56a OPT, which proves Theorem 13. II 

Henceforth, we will focus on proving Lemma [3^ Consider any time step t G [802-^"^, 802-^) for an 
integer j > 1; when j = the lemma trivially holds. Let St^i denote the set of elements that ALG-AG 
displays before time t and let et denote the element that ALG-AG schedules exactly at time t. Let Ej denote 
the set of elements that ALG-AG schedules until time 8a2^ . Let Ej denote the set of elements that OPT 
schedules until time 2^ . Recall that ALG-AG picks et as an element e that maximizes 



^ 1 - fi{St-x) 



i(i{m\.h{St-i)<\ 

This leads us to the following proposition. 
Proposition 3.3. Consider any j > 1 and any time step t £ [8a2-'~^, 8a2-'). Then for any e G E* , we have 

Proof. Since ALG-AG has chosen to schedule element et over all elements e £ Ej \ St-i, we know that 
the claimed inequality holds for any e G E* \ St~i. Further, the inequality holds for any element e in St-i, 
since /'^'^^ (e) = for such an element e. □ 
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By taking an average over all elements in E*, we derive 



> 



J_ V V USt-i^{e})-MSt-,) 

\E*\ ^ ^ l-fi(St-i) ^ ' 



Observe that in the second inequality, we can drop the constraint fi{St^i) < 1 in the subscript, since 
St-i does not cover fi for any i G Rj-. by definition, is not covered before time 8a2^ and t < 8a2^ . 
Due to submodularity of each function fi (more precisely by applying Proposition 12. 1 1 with A := E* and 
B := St-i), we have that 



_ 1 ^ MSt^iUE*)- fASt-i] 



IK* I - IK* I 



The equality is due to the fact that for any i ^ R*-, fi{E*) = 1 and each function fi is monotone. Hence: 

E f''-'i^t) > ^"^^'^,^'~'\ \R,\-\R*\) = M\R,\-\R^\), (2) 

8o-2i-l<t<8a-2J J 

where we used \Ej\ = 2K We now upper bound the left-hand-side of Q. To this end, we need the following 
claim from ifTl . 

Claim 3.4 (Claim 2.3 in fH). Given a monotone function f : 2^"^^ — > [0, 1] and sets = 5*0 ^ 5*1 C • • • C 

'S'^ ^ [n], we have (using the convention 0/0 = Oj 



E^ 



f{S,)-f{S,^,) < ^^1^1^ 



..1 - /(^^-l) - ^ 

Here 6 > is such that for any ACB, if f{B) - f{A) > then f{B) - f{A) > 6. 

Proof. We give a proof for completeness. Order the values in the set {f{Sk) \ < k < £} \ {1} in 
increasing order to obtain /3o < /3i < • • • < Ph- By the assumption, we have /3o = 0, 1 — fSn > 6 and 
Ph — > 5, V/i G [H]. Note that the desired inequality is reduced to: 



yt±^^ < in. 



If f{Si) = 1 then the summation we want to bound has an additional term of = 1- 
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Knowing that the function x is increasing for x S [0, 1), we derive 



ln(l-/3H) < In- 



1 
5 



dx 



This proves the claim. □ 

Note that any function fi not in Rj-i does not contribute to the left-hand-side of This is because 
such a function fi was akeady covered before time 8a 2^~^ < t. Further, knowing by Claim [34] that each 
function /j € can add at most a := 1 + ln i, we can upper bound the left-hand-side of ([2]) by a|i?j_i|. 
More formally, 

fSt-u \ \^ \^ fi{St-i U {et}) - fijSt^i) 

8a-23-i<t<8a-2J 8a-2J-i <t<8a-2i ieRj-i:f,{St-i)<l 

< E E 1-Ms,.,) "i^^-^i 

From (O and (O we obtain 

4a(|i?j| - \R*\) < a\Rj^i\ 
which simplifies to the desired inequality in Lemma [l!2l 



4 Minimum Latency Submodular Cover 

Recall that in the minimum latency submodular cover problem (MLSC), we are given a metric {V, d) with 
root r € y and m monotone submodulai^ functions /i, /2, /m : 2^ — )• [0, 1]. Each function fi satisfies 
the following properties: (1) fi{V) = 1 and (2) for any S' D S, if fi{S') - fi{S) > 0, then it must be the 
case that fi{S') — fi{S) > e, where e > is a value that is uniform for all functions. WLOG, we assume 
that d{-,-) is an integer. The goal is to find a path starting at r that minimizes the total cover time of all 
functions. 

As mentioned earlier, our algorithm for MLSC uses as sub-routine an algorithm for the Submodular 
Orienteering Problem (SOP). Here given metric {V, d), root r, monotone submodular function g : 2^ ^ 
]R_i_ and bound B, the goal is to compute a path P originating at r having length at most B that maximizes 
g{V{P)) where V{P) denotes the set of vertices covered by P. We assume a (p, cj)-bicriteria approximation 
algorithm ALG-SOP for SOP, i.e. on any SOP instance returns a path P of length at most a ■ B and 
g{V{P)) > OPT/p. We recall the following known results on SOP. 

Theorem 4.1 (111). For any constant 6 > there is a polynomial time (0(1), 0(log^''''^ I^D) bicriteria 
approximation algorithm for SOP. 

Theorem 4.2 ([12]). There is a quasi-polynomial time 0(log \V\) approximation algorithm for SOP. 
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We now describe our algorithm ALG-LSCP for MLSC that uses the (p, a) bicriteria approximation 
algorithm ALG-SOP. Here a = 1 + In ^. Note the difference from the submodular ranking algorithm 01: 
here each augmentation is a path possibly covering several vertices. Despite the similarity of ALG-LSCP 
to the min-latency TSP type algorithms lITOl [T3l an important difference is that we do not try to directly 
maximize the number of covered functions in each augmentation: as noted before this subproblem is at least 
as hard as dense-fc-subgraph, for which the best approximation ratio known is only polynomial lH. Instead 
we maximize in each step some proxy residual coverage function that suffices to eventually cover all 
functions quickly. This function is a natural extension of the single-element coverage values used in ALG- 
AG m. It is important to note that in Line (4), f^{-) is defined adaptively based on the cuiTcnt set S of 
visited vertices in each iteration. Moreover, since each function fi monotone submodular, so is f'^ for any 
S C V. In Line (6), vr • P implies the concatenation of vr and P. 

Algorithm 2 ALG-LSCP 

INPUT: {V,d),r € V;{n : 2^ ^ [0,l]}-i. 

1: 5^0, 7r^0. 

2: for fc = 0,1,2,... do 

3: Repeat the following Aap times. 

4: Define submodular function /^(T) := EieHJU^Xi ^^''^i-Msf^^ ^ T QV. 

5: Use ALG-SOP to find a path P of length at most a ■ 2^ starting from r that maximizes f^{V{P)) 

(within a factor of p) where V{P) is the set of nodes visited by P. 
6: S ^ SU V{P) and vr ^ vr ■ P. 
7: end for 

OUTPUT: Output solution vr. 



We prove the following theorem, which implies Theorem 11.11 and [L2l 

Theorem 4.3. ALG-LSCP is an 0{apa)-approximation algorithm for MLSC. 

We now analyze ALG-LSCP. We will say that the algorithm is in the jth phase, when the variable k of 
the for loop in Line (2) has value j. Observe that the final solution visits all vertices that are added in the jth 
phase within time 16ap2'' . This can be easily shown as follows: the final solution is a concatenation of the 
paths that were found in Line (5). Since all these paths are stitched at the root r, the length of tt at the end 
of phase j is at most Ylk=i ^ " ^'^P ' '^^'^ — ^^Oipa ■ 2K The following proposition easily follows. 

Proposition 4.4. Any vertex v added to S in the jth phase is visited by vr within 16apa ■ 2^. 

Let R{t) denote the set of (indices of) the functions that are not covered by ALG-LSCP earlier than 
time t; R{t) includes the functions that are covered exactly at time t as well. We will interchangeably use 
i G R{t) from fi G R{t) for notational simplicity. Let Rj := Rj^Wapa 2^). Similarly, we let R*{t) denote 
the set of functions that are not covered by OPT earUer than time t and let R* = R*{2^). For notational 
convenience, we let := 0. 

We will show the following key lemma. It shows that the number of uncovered functions by ALG-LSCP 
must decrease fast as j grows, unless the corresponding number by the optimal solution is comparable. 

Lemma 4.5. Consider any j > 0. Then we have \Rj\ < j\Rj^i\ + \Rj\- 

Assuming the above lemma holds, we can prove Theorem 14.31 exactly as we proved Theorem 13. ll in the 
previous section using Lemma [l!2l Thus it suffices to prove Lemma 1431 which we do next. 
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The lemma trivially holds when j = 0, hence consider any fixed phase j > 1. Let 5*0 denote the set of 
vertices that were add to S up to the end of phase j — I. Let H = Aap and Ti,T2, Th be the sets of 
vertices that were added in Line (6) in the jth phase. Let ^/^ = 5o U Ti U T2 U ... U T/j, VI < /i < H. We 
will prove Lemma l43] by lower and upper bounding the quantity 

A . fSh-i(rr ^ fij^h) - fi{Sh-l) 

A, .= ^/ in) = ^ ^ 1 - /,(5._i) 

h=\ h=l ie[m]:f,iSn-i)<l ^ ' 

We first lower bound Aj. Let T* denote the set of vertices that OPT visited within time 2^. Observe 
that in Line (5), ALG-LSCP could have visited all nodes in T* by choosing P as the prefix of length 2^ of 
OPT. Via the approximation guarantee of ALG-SOP, we obtain 

Proposition 4.6. For any h G [H] audi € Rj, we have /^'^-^(S'h) > i • /^^-i(r*). 

For the remaining proof, we only give a sketch, since it is very similar to the proof of Lemma |3^ We 
restrict our concern to the functions in Rj \ R*. Observe that for any i € Rj and h € [H], fi{Sh-i) < 1 
and that for any i ^ R*, fi{T*) = 1. Hence by summing the inequality in the above proposition over all 
functions in Rj \R*,we have 

^ h=i ieRj\R* ■'^^ ^' ^ 

We now upper bound Aj. Note by definition that for any i ^ Rj-i, fi{So) = 1 and therefore fi does 
not contribute to Aj. For any i G Rj-i, the total contribution of fi to Aj is at most a by Claim [33] Hence 
it follows that 

Aj<a|i?j_i| (5) 
Combining ^ and dS]) completes the proof of Lemma 1431 



5 New LP Relaxation for Covering Steiner Tree 

We revisit the covering Steiner tree problem (CST) Il20ll23l . Recall that its input consists of a metric {V, d) 
with root r and a collection of groups C 2^ where each group 51 G is associated with a requirement 
kg. The goal is to find a minimum cost r-rooted tree r that includes r and covers all groups. A group g is 
covered if r contains at least kg nodes from g. 

Konjeved et al. Il23l gave an 0((log m+log femax) -log gmax-^og ?i)-approximation where m is the num- 
ber of groups, kmax '■= maXggg kg is the maximum requirement and Qmax = max^gg \g\ is the maximum 
group size. Later, Gupta and Srinivasan 1201 gave an improved 0(logm • log gmax • log n)-approximation 
that has no dependence on kmax- In certain settings of parameters, the result |[20l yields an improvement 
of Q( io'°io"n ) ^"^^^ II23I . Both these previous works use the same LP that has a large integrality gap of 
^{kmax) Il23l . To circumvent this, they solved a sequence of LPs adaptively: In each step, the requirement 
of each group is partially satisfied (by a constant fraction), and the subsequent LP is defined over the residual 
requirement and the unvisited vertices. 

We introduce stronger constraints, that yield an LP for CST with integrality gap 0(logm • log gmax ■ 
logn). This new LP is an important ingredient in our algorithm for LCST (next section), and might be 
useful in other contexts as well. 
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5.1 Simplifying assumptions 

Following |[23ll20l . WLOG, we will proceed our analysis under the following simplifying assumptions. 

• The given graph is a (weighted) tree T = {V,E) with root r and weight We on each edge e ^ E. 

• The groups in Q are disjoint. 

• Every vertex in a group is a leaf, i.e. has degree 1. 

• Every vertex of degree 1 is in a group. 

We note that by the tree embedding in |[T4l . a ^-approximation on tree instances will immediately imply 
an 0(plog |y |)-approximation on general instances. Hence we focus on giving an 0(log?n • log (^max)" 
approximation on tree instances. 



5.2 Linear Program 

We let L denote the set of leaves in V. Because of the above simplifying assumptions, we can label each 
vertex v in a. group with a unique leaf-edge incident on it, and vice versa. We abuse notation by allowing 
j E L to denote both the leaf-vertex and its unique incident edge. For any subset of leaves L' C L, let 
cut(r, L') denote the family of edge-cuts that separate the root r from all vertices in L'. For notational 
convenience, we assume that there is a pseudo-edge incident on r that is an ancestor of all edges in E. We 
call this edge super-root and assume that it is alway included in our solution and has zero cost. Hence for 
any edge j € E, its parent edge, denoted as pe{j), is well-defined. 

We formulate the following linear programming relaxation for CST on tree instances. 



min ^WeXe (LPcst) 

e£E 

subject to: Xpe(e) ^ Xe Ve G (6) 

{kg - \A\) ^ a;j + J2 - 1^1 ^5 e ^> C g, 

jeB\L j(iBn{L\A) 

yB € cut{r,g\A) (7) 
Xe G [0, 1] yee E 

We first argue that this is a valid relaxation. Consider any instance of CST on trees and a fixed feasible 
solution (tree) r*, which gives a natural integral solution: Xe = 1 if and only if e G r*. We focus on 
constraints Q, since the other constraints are obviously satisfied. Consider any g ^ Q, A g and B G 
cut(r,5r \ A). Let t*{E \ A) denote the subtree r* \ Note that t*{E \ A) is connected, since 
A consists only of leaf edges. Since r* has at least kg edges in g (it is a feasible CST solution), \t*{E \ 
A)f]{g\A)\>kg-\A\. 

• Suppose that there exists j £ t*{E\ A) f] B such that j ^ L. Then since j £ B \ L, it follows that 
(A;^ — j^l) Ylj^B\L — ~ 1^1' hence the constraint is satisfied. 

• The remaining case has t* {E \ A) n B C L. In words, B cuts g\A from r using only leaf edges; so 
B^T*{E\A) f]{g \ A). Thus E,6Bn(AA) > \r*iE \ A) f]{g \A)\>kg- \A\. 

In either of the above cases, constraint ^ is satisfied. 
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5.3 Solving the LP 



Since LPcst has exponentially many constraints, in order to solve it in polynomial time, we need a separa- 
tion oracle. Again we will focus on constraints (O, since other constraints are only polynomially many. We 
observe such an oracle is reduced to finding a polynomial time algorithm for the following problem. 

Problem MinCutWith Exceptions: In this problem, we are given as input a tree T rooted at r with leaves 
L and cost £{e) on each edge e. Further, an integer > is given. The goal is to find a minimum cost cut 
that separates r from any D leaves. 

We first explain how the desired separation oracle is reduced to solving the above problem. Consider 
the constraints ^ for a fixed g ^ Q and all A C with |^| = r/ (finally we will iterate over all G C/ and 
< ry < n). Then kg — \A\ (the right-hand-side of the constraints) is also fixed. Given Xj values, we would 
like to find A g with |^| = rj and B G cut(r, (7 \ A) that minimizes the left-hand-side, and test if this 
is smaller than kg — t]. Formally, we can recast this into MinCutWith Exceptions as follows: Remove all 
edges from E that are not on any path from the root r to a vertex in g, and let T' be the resulting tree and 
this is the input tree to the problem. Note that leaves of T' are precisely g. For all leaf-edges j G g, let 
:= Xj', and for all non-leaf j £ T' \ g, £{j) := {kg — rj) ■ xj. Also set bound D := rj — \g\. 

We next show that IVIinCutWitllExceptions can be solved via a dynamic programming. 

Lemma 5.1. The problem MinCutWithExceptions can be solved in polynomial time. 

Proof. To formally describe our dynamic programming, we make some simplifying assumptions. By intro- 
ducing dummy edges of infinite cost, we assume WLOG that the tree T is binary and the root r is incident 
to exactly one edge e^.. Hence every non-leaf edge e has exactly two child-edges ei and 62. For any edge 
e G T let Tg denote the subtree of T rooted at e, i.e. Tg contains edge e and all its descendants. 

We define a recurrence for C[e, A;] which denotes the minimum cost cut that separates the root of Tg 
from exactly k leaves in Tg. Note that C[er, D] gives the optimal value. 

For any leaf-edge / set: 

if /fc = 0, 
C[f,k] = { nf) ifk = l, and 
00 otherwise. 



For any non-leaf edge e with children ei and 62, set: 

C[e,k] = < 

It can be checked directly that this recurrence computes the desired values. □ 



if A; = 0, 

min{C[ei, ki] + C[e2, k2] : ki + k2 = k} if 1 < A; < |L n Tg], 

min{£(e), min{C[ei, A;i] + C[e2, A;2] : ki + k2 = k}} ifA; = |Lnrg|, and 

00 otherwise. 



5.4 Algorithm 

Our algorithm uses a threshold rounding and then applies the elegant rounding scheme from ll23l (to a 
suitably modified fractional solution). In this respect it is somewhat similar to the algorithm in |l20l; however 
that approach solves LPs and rounds them iteratively. Whereas our algorithm uses a single LP. We describe 
the KRS rounding scheme in Section [5. 4. H and our main algorithm in Section [5. 4.21 
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5.4.1 The rounding scheme in ||23]| 



Algorithm 3 ALG-KRS lH 

INPUT: Undirected tree T = {V, E) rooted at r; Ze € [0, 1], Ve e such that for all e & E, Zp^f^^) > Ze- 
1: 5^0. 

2: For each e ^ E such that Ze > 0, add e to 5 with probability — 
OUTPUT: The connected component (tree) including r in S. 



The following proposition easily follows. 

Proposition 5.2 ( II23I ). Each edge e is included in the final solution of ALG-KRS with probability Zg- 

Definition 5.3 (KRS properties). For all g G G, consider any R{g) ^ g and any < rg < \R{g)\. We say 
that {z, R{g),rg) satisfies the KRS properties if it satisfies the following properties. 

Zpe{e) > Ze Ve € (8) 

Zj < rgZe Ve G £■ (9) 

iGT{e)nR(g) 

where T(e) is the subtree below (and including) edge e. 

The first property © is the same as the constraints ([6]). The second property Q is a Lipschitz-type 
condition that says conditional on any edge e being chosen, its subtree T(e) can contribute at most rg to the 
requirement of R{g). We remark that this property ^ was enforced explicitly in Il23ll20l as constraints (for 
specific R{g) and Vg) in their sequence of LPs. On the other hand, as we show later, our LP consti'aint © 
directly gives this property for all relevant R{g) and rg values. 

Lemma 5.4 ( II23I ). Suppose that [z, R{g),rg) satisfies the KRS properties. Let L^rs denote the set of leaves 
that are covered by ALG-KRS with the input Ze,e G E. Consider any constant 5 G [0, 1]. Then for any 
g € G, we have 

6^ -fig 



Pr 



\LkrsnR{g)\ < {1-S)fig 



< exp 



2 + rgil + ln\Rig)\) 



where fig := E[\Lkrs D R{g)\] = J2j^R(g) zj. 

Proof. We only give a quick sketch of the proof, since this is implicit in |[23l . For any G R{g), we say 
that j ~ j' if and only if (1) j / / and (2) the least common ancestor 1 ca(j, j') of j and j' is not r. Define 

i,i'eR(9):j~i',%,a(,.y)>o ^''^^^'^'> 
In Theorem 3.2 in ||23l . Konjevod et al. showed using the KRS properties that 

Ag < i^givg - 1 + rgln\R{g)\) 

We note that the proof of Theorem 3.2 implies this, although it is stated only for fig = Vg. Further, they 
used this bound in Jansen's inequality to obtain for any 5 G [0, 1], 

2 + Ag/f,g, 

Above fig = 'E[\Lkrs n R{g)\] = T^jeRig) ^^i^ ^ ^krs] = I2jeRig) Proposition O Combining the 
above two inequalities yields the lemma. □ 



Pr 



\Lkrsr\R{g)\ < {I -5) fig 



< exp 
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5.4.2 Main algorithm 



The algorithm first preprocesses x* an optimal solution to LPcst to obtain another solution x' that will used 
in the KRS -rounding. Let E* := {e ^ E | x* > |}; all edges in E* will be included in our solution with 
probability 1. Observe that E* forms a tree rooted at r due to the constraints (|7]). We let T' = T / E* denote 
the tree obtained by contracting edges E* in the original tree T; and E' = E\E* the edges of T' . For 
any group g ^ Q, define R{g) := g\ E* and Vg := kg — \g n E*\. Since E* already covers the vertices 
in g n E*, we will henceforth be concerned with getting coverage of at least Vg from vertices R{g). Our 
algorithm ALG-CST is formally described below. Note that for any e G E', z'^ = 2x'^ < 1 and 2pe(e) — ■^e' 
hence z' is a valid input to ALG-KRS with respect to T'. The final solution is the union of all sampled trees 
r plus the edges in E* that are initially shrunk. 

Algorithm 4 ALG-CST 

INPUT: Tree T with edge lengths, root r, groups Q and requirements 
{kg}geg- 

1: Let X*, e G -B be an optimal solution to LPcst- 

2: E* ^{eeE\xl> 1/2}, R{g) ^ g\E* and rg^kg-\gr\ E*\. 

3: Shrink all edges in E* and let T' be the resulting tree with edges E' := E\E*. 

4: Obtain solution x' from x* using the procedure in Lemma 1531 

5: For each e € E', z'^ ^ 2x'^\ note ^ G [0, 1]. 

6: 5^0. 

7: Repeat the following 6* (log m ■ log gmax) times: 

8: Add to S the tree r produced by ALG-KRS with Ze = 4 for ti-ee T' . 
OUTPUT: Combine all trees in S and E* and output the resulting tree. 



Lemma 5.5. There exist x'^ G [0, x*]/or all e & E\E* such that for all g G, {x' , R{g), Vg) satisfies the 
KRS properties in tree T' and X^jg^(g) x'^ > Vg. Furthermore, such x' can he computed in polynomial time. 

Proof. Consider constraints ([7]) of LPcst- Fix a group g ^ Q and let A := gr\ E*. Consider tree T' as a 
flow network with each leaf edge / having capacity x*j and each non-leaf edge e having capacity Vg ■ x*^. 
The root r is the source and leaves R{g) = g \ A are tiie sinks. Then constraints Q imply that the min cut 
separating r from R{g) has value at least r^: note that although these constraints are for the original tree T, 
they imply similar constraints for T' since T' is obtained from T by edge-contraction0 Hence there must 
exist a max-flow of volume at least rg from r to R{g) in the above network. Let x'^ denote the volume of 
this flow into each leaf edge / G R{g)', clearly we have that x', < x*r (due to capacity on leaves) and: 




(10) 



Moreover, by the capacities on non-leaves. 





ye€E 



(11) 



j€Tie)nR{g) 



''in particular every cut B' separating r from g \ ^ in T' is also a cut separating r from g\Am T. 
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We can use the above procedure on each group g & G separately to compute x'j for all leaf edges /; 
this is well-defined since groups are disjoint. For each non-leaf edge e ^ E' set x'^ := x*. Thus we have 
< Xg < X* for all e € E'. Observe that this computation can easily be done in poly-time. 

Now, (fTTI) implies the second KRS property Property ([8]) follows, since for each e G E', we have 
^pe(e) ~ ^pe(e) — ^ ^e' ^^c first inequality is due to constraint Q of LPcst- Finally (flOl) implies the 
coverage property claimed in the lemma. □ 



5.5 Analysis 

Consider a random tree r in Line (8) of ALG-CST. For a formal analysis, we assume that ALG-CST 
samples r, 41ogm(3 + In gmax) times independently. For each group 5 € ^, we will focus on lower 
bounding the probability that r satisfies the residual requirement rg for R{g). 

Lemma 5.6. For any group g e G, Pr[|r n R{g)\ < Vg] < exp(— 2p:pjj3___^)_ 

Proof. Consider any group g G G- By Lemma 1531 we know (x', R{g), rg) satisfies the KRS properties with 
respect to tree T'. From the definition of = 2 • Xg and that < 1 for all e € E', it immediately follows 
that [z' , R{g),rg) also satisfies the KRS properties on T' . Further, by Proposition 15.21 it follows 

^lg■.= nrr^R{g)W= ^ z;. = 2 x;. > 2r„ 

j&R{g) j€Rig) 

where we used the coverage property X]jei?(g) ^'j — ^9 given by Lemma [5751 By applying Lemma |574l with 
6 = 1/2, we have 



Pr 



""^''"l<'»l-°''K- 2(2 + ,,(lH-'ln|fito)|)) ) 



1 \ / 1 

exp I — — — — - — < exp 



2{3 + In \R{g)\ J - 2(3 + In ^^^^ 

Above we used rg > 1 and < gmax- D 

It follows that upon sampling the tree r independently 41nm • (3 + In gmax) times, each group g ^ G 

is covered with probability at least 1 t-. A simple union bound yields that all groups aie covered by the 

output of ALG-CST with probability at least 1 — 1/m. 

Now we upper bound the expected cost of the final solution. 

Lemma 5.7. With at least a half probability, the final solution of ALG-CST does not cost more than 
8 log m(3 + In gmax) times the optimum. 

Proof. From Proposition 15.21 we know that the expected cost of r is YleeE\E* -^^e — ^ J2eeE\E* — 
2 X^eeBXB* ^e- Hence, knowing that x* > 1/2 for all e G £'*, the expected cost of the final solution is at 
most 41ogm(3 + \ngmax) Y.eeE\E* '^eK + EeGB* ^ 41ogm(3 + lngmax)OP'^- Markov inequality 
yields the lemma. □ 

Hence we derive the following theorem. 

Theorem 5.8 ( li20l ). There is a randomized 0(logm • log gmax)-opproximation algorithm for CSTon tree 
metrics. For general metrics, there is a randomized 0(log m ■ log gmax ■ log \ V\)-approximation algorithm. 
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6 Latency Covering Steiner Tree Problem 



In this section, we give an improved algorithm for the latency covering Steiner tree problem (LCST), which 
is an interesting special case of MLSC. Recall that the input to LCST consists of a symmetric metric {V, d), 
root r € V and a collection Q of m groups with associated requirements {/cgjggg.The goal is find a path 
staring from r that minimizes the total cover time of all groups. We say that group g is covered at the earliest 
time t when the path within distance t visits at least kg vertices in g. We give an O {log gmax ■ log |^|)- 
approximation algorithm for this problem where gmax '■= max^gg is the maximum group size. 

The stronger constraints (|7]l, which we introduced in the previous section, will play a crucial role in 
our algorithm and analysis for LCST as well. We make the same simplifying assumptions that we made in 
Section |5?T] In particular, in order to obtain an 0{loggmax • log |y|)-approximation on general instances, 
it suffices to give an O (log (7max) -approximation on tree instances. We will follow most of the notation and 
will explain any difference in notation if and when necessary. 

6.1 Linear Programming Relaxation 

We formulate the following linear relaxation for tree instances of LCST. 



E(l-2/^) (LPlcst) 

see 

> xi yi>0,e£E (12) 
< 2^ > (13) 

> {k,-\A\)-yl 

ye>0,g eg,ACg, B ecut{r,g\A) (14) 

> 4 ye>0,geg (15) 
G [0, 1] yi>0,e£E 
G [0,1] V£ > 0,g € g 

To see that this is a valid relaxation, let OPT denote the optimal path. For any £ > let 0PT(2^) 
denote the prefix of length 2^ of OPT. We construct a feasible integral solution for OPTas follows. The 
variable indicates if edge e is covered by 0PT(2^). The indicator variable has value one if and only 
if group g is covered by 0PT(2^). Constraints (fT2l ) come from the fact that 0PT(2^) is a path starting at r. 
Constraints ([T3] ) say that the edges in 0PT(2^) have a total weight of at most 2^, which is clearly true. Note 
that for each £ > 0, we created a set of constraints ([T4l ) that are essentially equivalent to the constraints dT]) 
of LPcsT- Constraints (fT4l ) can be shown to be valid similarly as are the constraints Q. Constraints (fTSl ) 
enforce the fact that if group g is covered by 0PT(2^) and it must be covered by 0PT(2^+^) as well. Now 
consider the objective value: the total contribution of a group g that is covered by OPTat time G {2^, 2^^^] 
is i • Yle=o — ^'^^ Thus the objective value of this integral solution is at most OPT. 

It is easy to see that it suffices to consider £ of polynomial value: so the number of variables is polyno- 
mial. Note that constraints ([T4l ) are exponentially many. However, for each fixed i, we can use essentially 
the same separation oracle that we used for the constraints ([7]) of LPcst- 



mill - • 2^ 
2 ^ 

£>0 

subject to: a;J^(^) 

"^Wexi 

-\a\)y: 4 + E 

j&B\L jeBnL\A 
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6.2 Algorithm 

Let {x,y) denote a fixed optimal solution to LPlcst- Let := {e ^ E \ x^^ > 1/4}. Observe that E^ 
forms a tree rooted at r due to the constraints (fT2l) . The edges in E will be added to our solution with 
probability 1 by the ^th phase of our algorithm. Let R^{g) := g\ E^ and = kg — \g Ci E^\. Our algorithm 
will try to satisfy the residual requirement of group g by covering vertices in R^{g) when the group is 
substantially covered (say to extent i) by the optimal fractional solution {x,y). 
For each ^ > we preprocess x^ and obtain x^ that have "good" properties. 

Lemma 6.1. For any £ > 0, we can find in polynomial time x^ S [0, Xg], £ E \ E^ such that \/g € Q: 
1. {x, R^{g),rg) satisfies the KRS-properties in tree T^. 
2- Ejei?*{g)^i - ^g'yg (coverage property). 

Proof. The proof of this lemma is very similar to Lemma [531 For each £, we obtain the desired x^ values 
from x^ by taking similar steps as we did to set x' from x* in the proof of Lemma [531 □ 

We now describe our algorithm ALG-LCST as follows. We will say that ALG-LCST is in phase £ based 
on the value of i in the description. 

Algorithm 5 ALG-LCST 

INPUT: Tree T with edge lengths, root r, groups Q and requirements 
1: vr ^ 0. 

2: Let {x,y) be an optimal solution to LPlcst- 
3: for£ = 0, l,2,...do 

4: E^ ^{eeE\xe> 1/4}, R\g) ^ g\E^ and rg^kg- \gr\E^\. 

5: Shrink all edges in E^ in T and let be the resulting tree with the edge set & := E \ E^ . 

6: Obtain solution x^ from x^ using Lemma [6H 

7: For each e G E^, zf ^ Ax^; note that € [0, 1]. 

8: ^ 0. 

9: Repeat the following 6(3 + log gmax) times: 
10: ^ the tree produced by ALG-KRS with z = on tree 

11: Addr^tOcS^ 

12: Combine all trees in with E and make the resulting tree as a tour starting at r 
13: If has weight at most 192(3 + log gmax) ■ 2^ then 
14: vr ^ TT • P^. 

15: end for 

OUTPUT: The path vr. 



6.3 Analysis 

For any group g, define i{g) as the smallest £ > such that Ijg > 1/2. Then it follows that for any £ > £{g), 
Vg > 1/2 due to constraints ([T51 ) of LPlcst- In words, the optimal fractional solution covers group g by at 
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least half within time 2^(3^. Consider any group g G Q, £ > £{g) and a tree in Line (10) of ALG-LCST. 
Since all edges in E are included in with probability 1 , group g is covered by if and only if at least 
vertices in the residual group R^{g) are covered by r^. This motivates us to derive the following lemma. 

Lemma 6.2. For any g eG and £ > £{g), Pr[|r^ n R{g)\ < Vg] < exp(- ^^3^^^^^^^^-^ ). 

Proof. From Lemma l6. 1 l it follows that {x^, R^{g),rg) satisfies the KRS properties on tree T^. Furthermore, 
using yg>^ and the coverage property in Lemma I67T1 

:= E[\T'nR\g)\] = ^ 4 = E ^ ^ • • > 2r^ 
Here we also used Proposition 15 .21 that Pr[j G r^] = z^. By applying Lemma with 5 = 1/2, we have 



Pr 



\r'r^R{g)\ < 



J" 

^ exp ( - — — — ,,,, ) < exp 



2(2 + r^(l + ln|i?^(5)|)); - 2(3 + ln5^;„,, 

This proves Lemma ld!2l □ 

We now show the following lemma. 

Lemma 6.3. Consider any group g £ Q and £ > £{g). The probability that P^ has a total weight of at most 
192(3 + log gmax) ■ 2^ and covers g is at least 3/4. 



Proof. By Proposition 15.21 we know that each edge e € £' is included in r with probability = 4:X^. 
Since for all e G , > 1/ 4, the expected total weight of the edges in E and is upper bounded by 

^^We+^^We-4:X^ < 4:'^We-xi < 4-2^' 

The last inequality is due to the constraints ([T3] ). Hence the expected cost of is at most 24(3 + log gmax) " 
2^. Markov inequality immediately gives that the total weight of P^ is greater than 192(3 + log^max) ■ 2^ 
with probability at most 1/8. Since is sampled 6(3 + log (7max ) • 2^ times, from Lemma[6]2j we know that 
group g is not covered by P^ with a probability of at most 1/e^ < 1/8. Hence the claim follows. □ 

Fix any group g ^ Q, and £ > £{g). Among P^^d) ^ ^ p^^ consider the paths that are added 

to vr. Clearly the total weight of such paths is at most 0{[oggmax ■ 2^). By Lemma [631 the probability 
that none of these paths covers g is at most ^e-e\g)+i ■ Hence the expected cover time of g is at most 
Y^e>e{g) 0{loggmax) ■ 2^ • ^e~e\g)+i = 0{loggmax ' 2^^9)y Thus the expected total cover time 

ALG-LCST < 0(log5max)-5^2^(f) 

gee; 

By the definition of 2^^^') being ^ -completion time in the LP, we know 

i>o geg geg geg 

These two inequalities show that ALG-LCST is an O (log (jrmax) -approximation on tree instances. By 
the tree embedding due to HU, we conclude that ALG-LCST can be used to yield an 0(Iog gmax ■ log \ V\)- 
approximation for general metrics, thus proving Theorem II .31 
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7 Stochastic Submodular Ranking 



In this section, we study the stochastic submodular ranking problem. The input consists of a set ^ = 
{Xi, ...,Xn} of n independent random variables (stochastic elements, each over domain A) with integer 
sizes (deterministic), and m monotone submodular functions /i, [0, 1] on groundset 

A. We are also given the distribution (over A) of each stochastic element {Xi}f^^. The realization Xj € A 
of random variable Xi is known immediately after scheduling/displaying Xj. Here, Xi requires ii units 
of time to be scheduled; recall that in the non-stochastic submodular ranking, elements were assumed to 
have unit sizes. A feasible solution/policy is an adaptive ordering of A, represented naturally by a decision 
tree with branches corresponding to the realization of the stochastic elements. We use (7r(l), . . . , vr(n)) to 
denote this ordering, where each Tr{l) is a random variable denoting the index of the l^^ scheduled element. 

The cover time cov(/j) of is defined as the earliest time t such that /j has value one on the realization 
of the elements completely scheduled within time t. More formally, cov(/i) is the earliest time t such that 
fi{{^-K{i) ) ■■■jXn{kt)}) — 1 where kt is the maximum integer such that + £^(2) + ••• + ^TT(kt) ^ If the 
function value never reaches one (due to the stochastic nature of elements) then cov(/j) = ^1 + £2 + ••■ + ^n- 
Note that the cover time is a random value. The goal is to find a policy that (approximately) minimizes the 
expected total cover time E J2ie[m] cov(/j) . As in the previous sections, each function /j satisfies the 
following property: for any S 5 S', if fi{S) - fi{S') > then it must be the case that fi{S) - fi{S') > e, 
where e > is a uniform value for all functions /j. 

Our stochastic extension captures many interesting applications. 

Adaptive Set Cover: We are given as input a ground set A, and a collection 5 C 2^ of (non-stochastic) 
sets. There are stochastic elements {Xi : i G [n]}, each defined over A, and having costs. The goal is to 
give an adaptive policy that hits all sets in S using the minimum cost of (stochastic) elements. This problem 
was studied in |[T7ll26ll24l . The problem can be shown as an instance of stochastic SR: There is a single 
monotone submodular function fi{A) := J2seS |^ n 5|} and e = 



Shared Filter Evaluation: This problem was introduced by 11261 . and the result was improved to an essen- 
tially optimal solution in |[24l . In this problem, there is a collection of independent filters Xi, X2, X^, 
each gets evaluated either to True or False. For each filter i € [n], we are given the "selectivity" 
Pi = Pr[Xj is true] and the cost £{ of running the filter. We are also given a collection Q of queries, where 
each query Qj is a conjunction of a subset of queries. We would like to determine each query in Q to 
be True or False by (adaptively) testing filters of the minimum cost. In order to cast this problem as 
stochastic SR, we use A = UiLii**' ^'^^ &&ch i G [71], Xi = ti w.p. pi, and Xi = fi with the remaining 
probability 1 — pi. We create one monotone submodular function: 

/i(A):=^ min/l, \A n {fi : i e Qj}\ + ^ ■ \A D {U : i e Qj}\] 

We note that the Shared Filter Evaluation problem can be studied for a latency type of objective also. In 
this case, for each query Qj € Q, we create a separate submodular function: 

:=min|l, |A n {/, : i G + • |A n {t, : i G | 

In this case, the stochastic SR problem corresponds precisely to filter evaluation that minimizes the average 
time to answer queries in Q. 
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stochastic generalized min-sum set cover: We are given as input a ground set A, and a collection 5 C 2^ 
of (non-stochastic) sets with requirement k{S) for each S G S. There are stochastic elements {Xi : i € [n]}, 
each defined over A. Set S" € 5 is said to be completed when at least k{S) elements from S have been 
displayed. The goal is to find an adaptive ordering of [n] so as to minimize the expected total completion 
time. This can be reduced to stochastic SR by defining function f^{A) := min{l, \A n S\/k{S)} for each 
S £ S; here e = l/kmax where kmax denotes the maximum requirement. 

We consider adaptive policies: this chooses at each time ^^(i) + ^^(2) + ••• + ^7r(fc-i)' the element 
Xn(k) G X^(2), X^(3), ^7r{fc-i)} after observing the realizations ...,x^^k-i)- So it can 

be described as a decision tree. Our main result is an 0(log i) -approximate adaptive policy. This result is 
again inspired by our simpler analysis of IH. 

Remark 7.1. Obviously, the stochastic submodular ranking problem generalizes its deterministic counter- 
part where elements have non-uniform sizes. We note that /fTl/ studied the deterministic submodular ranking 
only when all elements have unit sizes. 

Remark 7.2. For stochastic generalized min-sum set cover ( defined above ), our result implies an O (log k^ax ) - 
approximation to adaptive policies. However for non-adaptive policies, one can obtain a better 0(1)- 
approximation algorithm by combining the Sample Average Approximation (SAA) method l[22\ with 
0(1) -approximations known for the non-stochastic version 0] 1251/ . 

7.1 Algorithm and Analysis 

To formally describe our algorithm, we quickly define the probability spaces we will be concerned with. 
We use = A" to denote the outcome space of A. We use the same notation to denote the probability 
space induced by this outcome space. For any S C -A and its realization s, let Q,{s) denote the outcome 
subspace that conforms to s. We can naturally define the probability space defined by Q,{s) as follows: The 
probability that w E Q,{s) occurs is Prf7[u']/ PrQ[r2(s)]. We also use Q,{s) to denote this probability space. 

The main algorithm is a natural extension of the algorithm by Azar and Gamzu HI, which is given 
below. In the algorithm. Let a := 1 + In(-). In the output, it{1) denotes the Ith element (random variable) 
in A that appeai^s in the output. 

Algorithm 6 ALG-AG-STO 

INPUT: A = {Xi, ...,Xn} with {h, /i : 2^ ^ [0, 1], i € [m]. 
1:5^0. (5 are the elements completly scheduled so far, and s their instantiation.) 
2: while There exists fi such that fi{s) < 1 do 
3: Choose element X^ as follows: 




.ieM./i(s)<i 




fi{sU{X,})-f,{s) 

1 - m 



4: S^S[j{Xe}. 

5: tt{\S\) Xe. Schedule Xe and observe its realization. 
6: end while 
OUTPUT: An adaptive ordering vr of A. 
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Observe that taking expectation over Q{s) in Step [3] is the same as expectation over the distribution 
of Xf, since ^ S and the elements are independent. Also note that this algorithm implicitly defines a 
decision tree. 

Theorem 7.3. ALG-AG-STO is an 0{ln{^))-appwximation algorithm for stochastic submodular ranking. 

To simplify notation, without loss of generality, we assume that a is an integer. Let R{t) denote the 
(random) set of functions that are not satisfied by ALG-AG-STO before time t. Note that the set R{t) 
includes the functions that are satisfied exactly at time t. Analogously, the set R*{t) is defined for the 
optimal decision tree. For notational convenience, we will use i € R{t) interchangeably with /j G R{t). 
Let C{t) := {/i,.. .,/,„} \ R{t) and C*{t) := {/i,...,/^} \ R*{t). Note that all the sets C(-), C*{-), 
R{-), R*{-) are stochastic. We have that ALG = EteM l^^^^' '^^'^ Ete[n] l^*(*)l and hence 
ALG and OPT are stochastic quantities. We will show that EALG = 0{a) • EOPT which suffices to prove 
Theorem 17. 3 1 

We will be interested in the number of unsatisfied functions at times {8a2^ : j G Z+j by ALG-AG-STO 
and the number of unsatisfied functions at times {2^ : j G Z+} by the optimal solution. Let Rj := R{8a2^) 
and R* = R*{2^). It is important to note that Rj and R* are concerned with different times, and they are 
stochastic. For notational simplicity, we let R^i := 0. 

We will show the following key lemma. Roughly speaking, it says that the expected number of unsatis- 
fied functions by ALG-AG diminishes quickly, unless it is comparable to the expected number of unsatisfied 
functions in OPT. Once we get this lemma, we can prove Theorem 17.31 similar to the proof of Theorem l3.1l 

Lemma 7.4. For any j > 0, we have¥.[\Rj\] < lE[\Rj^i\] +E[\R*\]. 

Proof. The lemma trivially holds for j = 0, so we consider any j > I. For any i > 1, we use sj_i to denote 
the set of elements completely scheduled by ALG-AG-STO by time t — 1 along with their instantiations; 
clearly this is a random variable. Also, for t > 1 let a{t) G [n] denote the (random) index of the element 
being scheduled during time slot (t— 1, Note that Sf_i determines a{t) precisely, but not the instantiation 
of ^cr(t) • Let E* Q Ahe. the (stochastic) set of elements that is completely scheduled by the optimal policy 
within time 2-^ . For a certain stochastic set (or elements) 5, we will denote its realization under an outcome 
w as S{w). For example, Xi{w) G A is the realization of element Xi for outcome w; and E*-{w) is the set 
of first 2^ elements completely scheduled by OPT (under w) along with their realizations. 
For any time t and corresponding outcome define a set function: 

y VI? C A. 

i£\m]Ji(st-i)<l 

We will also use f^*^^{D) to denote the term inside the above summation. 

It is easy to see that the function /^'^^ : 2^ 1R+ is submodular. We also define 

F^'-H^e) := E ^^nis,_,) [f''-'{Xe{w))] , VXe G A. (16) 
Observe that this is zero for elements X^ G sj-i. 
Proposition 7.5. Consider any time t G [n] and outcome sj_i. Note that Sf_i determines cr{t). Then: 



^Since elements have different sizes, note tliat a{t) is different from ■n[t) which is the tth element scheduled by ALG-AG-STO. 
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Proof. At some time t' < t (right after st-i is observed) ALG-AG-STO chose to schedule element ^CT(t) 
over all elements Xi G A \ sj_i. By the greedy rule we know that the claimed inequality holds for any 
Xi G A \ st-i- Furthermore, the inequality holds for any element Xi £ st_i, since F**"^ (Xi) = for such 
an element. □ 



We now define the expected gain by ALG-AG-STO in step t as: 

1 



Gt 



E 



St-l 



And the expected total gain: 



8a2-' 

E 



Gt 



(17) 



(18) 



We complete the proof of Lemma [741 by upper and lower bounding A-. 



Upper bound for Aj. Fix any outcome w £ Q,. Below, all variables are conditioned on w and hence they 
are all deterministic. (For ease of notation we do not write w in front of the variables). 

t=8a2J-^ "^^> t=8Q2J-i "^(^^ jeH:/i(st-i)<l 

8a2J 



^ E 7^ E /r-^(-^w) ^ Et^ E ft-(^. 

fi{Tk) — fi{Tk~i 



^ ^ ^ ^ 



E E 



1 - /.m-i) 



The first inequality uses the fact that any i ^ has /j already covered before time 8a V ^, and so it 
never contributes to Aj. In the last expression, := Xjr(fc)} ^ A, the first instantiations seen 

under w. The equality uses the fact that for each Yl^=i ^7r(i) < ^ ^ Sj=i ^■^U) ^^'^^ st-i = T^-i and 
(T(t) = k. Finally, by Claim [l!4l the contribution of each function /j € Rj-i is at most a := 1 + In i. Thus 
we obtain Aj{w) < a\Rj-i{w)\, and taking expectations, 

Aj < aE[\Rj^i\] (19) 



Lower bound for Aj. Consider any 8a2^^^ < t < 8a2^ . We will lower bound Gf. Condition on St_i; 
this determines a{t) (but not x^(t)). Note that Y17=i ' ^"^i^i ^ E*\st-i] < 2^ by definition of E* being 
the elements that ai^e completely scheduled by time 2^ in OPT. Hence we have 

§ • Pr[X,eE*\st.i] < 1. 
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By applying Proposition 17. 51 with the convex multipliers (over i) given above, 



2J 

= 1 Pr[w\st-i] Yl f'-'iMw)) (20) 

toGt^(st-i) X,GE*{w) 

The first equality is by definition of from ( fT6l ). The second equality holds since the optimal 

policy must decide whether to schedule Xj (by time 2^) without knowing the realization of Xi. Now for 
each w G il(sf_i), due to submodularity of the function /^^'^^ (•), we get 

Y r-'mw)) > r-^{E*{w)) 

^ ME*{w))-n{st-i) 

> |C;(t/;)|-|C7(t,t/;)| (21) 

Recall that Ej{w) denotes the set of elements scheduled by time 2^ in OPT(conditional on w), as well 
as the realizations of these elements. The equality comes from the definition of /'''"^ The last inequality 
holds because C{t,w) = {i £ [m] : fi{st-i) = 1} and set E*{w) covers functions C*{w). Combining 
(l20l ) and (EB gives: 

^ F^-H^.W) > I ] -n\C{t)\ I st-i ]). (22) 



By deconditioning the above inequality (taking expectation over st-i) and using ([TTl) . we derive: 



> - .^E[|C;|]-E[|C(t)|]j > - .(E[|C7;|]-E[|C,|]^, 

where the last inequality uses E[C(t)] is non-decreasing and t < 8a2K 
Now summing over all t G [8a2^~^, 8a2^) yields: 

8q2J 

A,- = Gt > 4a(E[|C;|]-E[|C,|]) = 4a(E[|i?,|] - E[|i?;|]) (23) 

t=8a2J-i 

Combining (1231 ) and ( fT9l ). we obtain: 

4a(E[\Rj\]-E[\R*\]) < aE[\Rj_^\] 
which simplifies to the desired inequality in Lemma 1741 □ 
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